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ABSTRACT 



The invention relates to genetically engineered soluble fusion proteins consisting of 
human proteins or parts thereof not belonging to the immunoglobulin family and 
various portions of the constant region of immunoglobuHn molecules. The functional 
properties of both fusion components are surprisingly retained in the fusion protein. 
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and 

The General Hospital Corporation 



Description 

Fusion proteins with InBamoglobulin portions, the 
preparation and use tliereof 



The invention relates to genetically engineered soluble 
fusion proteins composed of hiiman proteins not belonging 
to the immunoglobulin family, or of parts thereof, and of 
various portions of the constant region of immunoglobulin 
molecules . The functional properties of the two fusion 
partners are, surprisingly, retained in the fusion 
protein . 

EP-A 0 325 262 and EP-A 0 314 317 disclose corresponding 
fusion proteins composed of various domains of the CD4 
membrane protein of human T cells and of human IgGl 
portions. Some of these fusion proteins bind with the 
same affinity to the glycoprotein gpl20 of human immuno- 
deficiency virus as the cell-bound CD4 molecule. The CD4 
molecule belongs to the immunoglobulin family and, 
consequently, has a very similar tertiary structure to 
that of immunoglobulin molecules. This also applies to 
the a chain of the T-cell antigen receptor, for which 
such fusions have also been described (Gascoigne et al., 
Proc. Natl. Acad. Sci. USA, vol. 84 (1987), 2937-2940). 
Hence, on the basis of the very similar domain structure, 
in this case retention of the biological activity of the 
two fusion partners in the fusion protein was to be 
expected . 

The human proteins which are, according to the invention, 
preferably coupled to the amino terminus of the constant 
region of immunoglobulin do not belong to the immuno- 
globulin family and are to be assigned to the following 
classes: (i) membrane-bound proteins whose extracellular 
domain is wholly or partly incorporated in the fusion. 
These are, in particular, thromboplastin and cytokine 
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receptors and growth factor receptors, such as the 
cellular receptors for interleukin-4 , interleukin-7 , 
tumor necrosis factor, GM-CSF, G-CSF, erythropoietin; 
(ii) non-membrane-bound soluble proteins which are wholly 
5 or partly incorporated in the fusion. These are, in 

particularly, proteins of therapeutic interest such as, 
for example, erythropoietin and other cytokines and 
growth factors . 



The fusion proteins can be prepared in known pro- and 
10 eukaryotic expression systems, but preferably in m^lmmal- 

ian cells (for example CHO, COS and BHK cells). 



The fusion proteins according to the invention are, by 
reason of their immunoglobulin portion, easy to purify by 
affinity chromatography and have improved pharmacokinetic 
15 properties in vivo. 

In many cases ^ the Fc part in fusion protein is 
thoroughly advantageous for use in therapy and diagnosis 
and thus results, for example, in improved pharma- 
cokinetic properties (EP-A 0232 262). On the other hand, 

20 for some uses it would be desirable to be able to delete 

the Fc part after the fusion protein has been expressed, 
detected and purified in the advantageous manner 
described. This is the case when the Fc portion proves to 
be a hindrance to use in therapy and diagnosis, for 

25 example when the fusion protein is to be used as antigen 

for immunizations. 



There are in existence various proteases whose use for 
this purpose appears conceivable. Papain and pepsin are 
employed, for example, to generate F(ab) fragments from 

30 immunoglobulins ( Immunology, ed. Roitt , I . et al . , Gower 

Medical Publishing, London (1989)), but they do not 
cleave in a particularly specific manner. Blood coagula- 
tion factor Xa by contrast recognises in a protein the 
relatively rare tetrapeptide sequence Ile-Glu-Gly-Arg and 

35 performs a hydrolytic cleavage of the protein after the 



arginine residue . Sequences which contain the 
described tetrapeptide were introduced first by Nagai and 
Thogersen in a hybrid protein by genetic engineering 
means (Nagai, K. and Thogersen, H.C-, Nature, vol. 309 
(1984), 810-812). These authors were able to show that 
the proteins expressed in E. coli actually are specifi- 
cally cleaved by factor Xa. However, there is as yet no 
published example of the possibility of such proteins 
also being expressed in eukaryotic and, especially, in 
animal cells and, after their purification, being cleaved 
by factor Xa • However, expression of the proteins 
according to the invention in animal cells is preferable 
because only in a cell system of this type is there 
expected to be secretion of, for example, normally 
membrane-bound receptors as fusion partners with 
retention of their natural structure and thus of their 
biological activity. Secretion into the ceil culture 
supernatant facilitates the subsecjuent straightforward 
purification of the fusion protein. 

The invention thus relates to genetically engineered 
soluble fusion proteins composed of human proteins not 
belonging to the immunoglobulin family, or of parts 
thereof, and of various portions of the constant regions 
of heavy or light chains of immunoglobulins of various 
subclasses (IgG, IgM, IgA, IgE). Preferred as immuno- 
globulin is the constant part of the heavy chain of human 
IgG, particularly preferably of human IgGl, where fusion 
takes place at the hinge region. In a particular embodi- 
ment , the Fc part can be removed in a simple way by a 
cleavage sequence which is also incorporated and can be 
cleaved with factor Xa . 

Furthermore, the invention relates to processes for the 
preparation of these fusion proteins by genetic engineer- 
ing, and to the use thereof for diagnosis and therapy. 

Finally, the invention is explained in further examples. 
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Example 1: Thxcsmboplastin fusion piroteinB 

Blood coagulation is a process of central importance in 
the human body. There is appropriately delicate regula- 
tion of the coagulation cascade, in which a large number 
5 of cellular factors and plasma proteins cooperate. These 

proteins (and their cof actors) in their entirety are 
called coagulation factors. The final products of the 
coagulation cascade are thrombin, which induces the 
aggregation of blood platelets, and fibrin which stabil- 

10 izes the platelet thrombus. Thrombin catalyzes the 

formation of fibrin from fibrinogen and itself is formed 
by limited proteolysis of prothrombin. Activated factor 
X (factor Xa) is responsible for this step and, in the 
presence of factor Va and calcium ions, binds to platelet 

15 membranes and cleaves prothrombin. 

Two ways exist for factor X to be activated, the extrin- 
sic and the intrinsic pathway. In the intrinsic pathway 
a series of factors is activated by proteolysis in order 
for each of them to form active proteases. In the extrin- 

2 0 sic pathway, there is increased synthesis of thrombo- 

plastin (tissue factor) by damaged cells, and it acti- 
vates factor X, together with factor Vila and calcium 
ions , It was formerly assumed that the activity of 
thromboplastin is confined to this reaction. However, the 

2 5 thromboplastin/VIIa complex also intervenes to activate 

the intrinsic pathway at the level of factor IX. Thus, a 
thromboplastin/VIIa complex is one of the most important 
physiological activators of blood coagulation - 

It is therefore conceiveible that thromboplastin, apart 
30 from its use as diagnostic aid (see below), can also be 

employed as constituent of therapeutic agents for treat- 
ing inborn or acquired blood coagulation deficiencies. 
Examples of this are chronic hemophilias caused by a 
deficiency of factors VIII, IX or XI or else acute 
35 disturbances of blood coagulation as a consequence of, 

for example , liver or kidney disease . Use of such a 
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therapeut^^agent after surgicial intervention would also 
be conceivable. 



Thromboplastin is an integral membrane protein which does 
not belong to the immunoglobulin family. Thromboplastin 
5 cDNA sequences have been published by a total of four 

groups (Fisher et al., Thromb. Res., vol. 4 8 ( 19 87), 
89-99 ; Morrisey et al . , Cell, vol. 50 (1987), 129-135; 
Scarpati et al.. Biochemistry, vol. 26 (1987), 5234-5238; 
Spicer et al , , Proc . Natl. Acad. Sci. USA, vol. 84 

10 (1987 ), 5148-5152). Thromboplastin cDNA contains an open 

reading frame which codes for a polypeptide of 2 95 amino- 
acid residues, of which the 32 N-terminal amino acids act 
as signal peptide. Mature thromboplastin comprises 
2 63 amino-acid residues and has a three-domain structure: 

15 i) amino-terminal extracellular domain (219 amino-acid 

residues); ii) transmembrane region (23 amino-acid 
residues); iii) cytoplasmic domain (carboxyl terminus; 
21 amino-acid residues). In the extracellular domain 
there are three potential sites for N-glycosylation 

2 0 (Asn-X-Thr) . Thromboplastin is normally glycosylated but 

glycosylation does not appear essential for the activity 
of the protein (Paborsky et al., Biochemistry, vol. 29 
( 1989) , 8072-8077) . 



Thromboplastin is required as additive to plasma samples 
in diagnostic tests of coagulation. The coagulation 
status of the tested person can be found by the one-stage 
prothrombin clotting time determination (for example 
Quick's test). The thromboplastin required for diagnostic 
tests is currently obtained from human tissue, and the 
preparation process is difficult to standardize, the 
yield is low and considerable amounts of human starting 
material (placentae) must be supplied. On the other hand, 
it is to be expected that preparation of native, 
membrane-bound thromboplastin by genetic engineering will 
also be difficult owing to complex purification proces- 
ses. These difficulties can be avoided by the fusion 
according to the invention to immunoglobulin portions. 
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The thro^^plastin fusion proteins^^cccrding to the 

invention are Becreted by mammalian cells (for excimple 
CHO, BHK, COS cells) into the culture medium, purified by 
affinity chromatography on protein A-Sepharose and have 
surprisingly high activity in the one-stage prothrombin 
clotting time determination. 



Cloning of thromboplastin cDNA 



The sequence published by Scarpati et al . , Biochemistry, 
vol, 26 ( 1987), 5234-5238, was used for cloning the 
thromboplastin cDNA. Two oligonucleotide probe molecules 
(see Fig. 1) were derived from this. These two probe 
molecules were used to screen a cDNA bank from human 
placenta (Grundmann et al . , Proc . Natl. Acad. Sci. USA, 
vol. 83 (1986), 8024-8028), 



CDNA clones of various lengths were obtained. One clone, 
2b-Apr5, which is used for the subsecfuent procedure, 
codes for the seone amino-acid sequence as the cDNA 
described in Scarpati et al. Fig. 2 depicts the total 
sequence of the clone 2b-Apr5 with the thromboplastin 
amino-acid sequence deduced therefrom. 

ConBtiruction of a hybrid plasaid pTFlFc coding for 
thromboplastin fusion protein • 

The plasmid pCD4E gamma 1 (EP 0 325 2 52 A2; deposited at 
the ATCC under the number No. 6 7610) is used for 
expression of a fusion protein composed of human CD4 
receptor and human IgGl. The DNA sequence coding for the 
extracellular domain of CD4 is deleted from this plasmid 
using the restriction enzymes Hindlll and BamHI . Only 
partial cleavage must be carried out with the enzyme 
Hindlll in this case, in order to cut at only one of the 
two Hindlll sites contained in pCD4E gamma 1 (position 
2198). The result is an opened vector in which a eukary- 
otic transcription regulation sequence (promoter) is 
followed by the open Hindlll site. The open BamHI site is 
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located at the start of the coding regions for a penta- 
peptide linker, followed by the hinge and the CH2 and CH3 
domains of human IgGl. The reading frame in the BamHI 
recognition sequence GGATCC is such that GAT is trans- 
5 lated as aspartic acid. DNA amplification with 

thermostable DNA polymerase makes it possible to modify 
a given sequence in such a way that any desired sequences 
are attached at one or both ends . Two oligonucleotides 
able to hybridize with sequences in the 5 ' -untranslated 
10 region (A: 5' GATCGATTAAGCTTCGGAACCCGCTCGATCTCGCCGCC 3') 

or 

coding region 

( B : 5 ' GCATATCTGGATCCCCGTAGAATATTTCTCTGAATTCCCC 3 ' ) of 
thromboplastin cDNA were synthesized. Of these, oligo- 
15 nucleotide A is partially homologous with the sequence of 

the coding strand, and oligonucleotide B is partially 
homologous with the non-coding strand; cf. Fig, 3. 

Thus, amplification results in a DNA fragment (827 bp) 
which contains (based on the coding strand) at the 5' end 
before the start of the coding sequence a Hindlll site, 
and at the 3' end after the codon for the first three 
amino-acid residues of the transmembrane region a BamHI 
site. The reading frame in the BamHI cleavage site is 
such that ligation with the BamHI site in pCD4E gamma 1 
results in a gene fusion with a reading frame continuous 
from the initiation codon of the thromboplastin cDNA to 
the stop codon of the heavy chain of IgGl. The desired 
fragment was obtained and, after treatment with Hindlll 
and BamHI, ligated into the vector pCD4E gamma 1, as 
described above, which had been cut with Hindlll 
(partially) and BamHI. The resulting plasmid was called 
pTFlFc (Fig* 4 ) . 

Transfection of pTFlPc into mamoalian cells 

The fusion protein encoded by the plasmid pTFlFc is 
35 called pTFlFc hereinafter. pTFlFc was transiently 

expressed in COS cells. For this purpose, COS cells were 



25 
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trans fected with pTFlFc with the aid of DEAE-dextran 
(EP A 0 325 262), Indirect immunofluorescence investiga- 
tions revealed that the proportion of transfected cells 
was about 25 %. 24 h after transf ection, the cells were 
transferred into serum-free medium. This cell supernatant 
was harvested after a further three days . 

Purification of pTFlPc fusion protein from cell culture 
Buperxiatant.s 

170 ml of supernatant from transiently transfected COS 
cells were collected overnight in a batch process in a 
column containing 0.8 ml of protein A-Sepharose at 4°C, 
washed with 10 volumes of washing buffer (50 mM tris 
buffer pH 8.6, 150 mM NaCl ) and eluted in 0.5 ml frac- 
tions with eluting buffer (93:7 100 mM citric acid: 
100 mM sodiiim citrate) . The first 9 fractions were 
immediately neutralized with 0 . 1 ml of 2M tris buffer 
pH 8.6 in each case and then combined, and the resulting 
protein was transferred by three concentration/dilution 
cycles in an Amicon microconcentrator (Centricon 30) into 
TNE buffer (50 mM tris buffer pH 7.4, 50 mM NaCl, 1 mM 
EDTA) . The pTFlFc obtained in this way is pure by 
SDS-PAGE electrophoresis (U.K. Lammli, Nature 227 (1970) 
680-685). In the absence of reducing agents it behaves in 
the SDS-PAGE like a dimer (about 165 KDa). 

25 Biological activity of purified TFlFc in the pirothxambin 

clotting time determi nat^ion 

TFlFc fusion protein is active in low concentrations 
{> 50 ng/ml) in the one-stage prothrombin clotting time 
determination (Vinazzer, H, Gerinnungsphysiologie und 
30 Methoden im Blutgerinnungs labor (1979), Fisher Verlag 

Stuttgart) . The clotting times achieved are comparable 
with the clotting times obtained with thromboplastin 
isolated from human placenta. 



15 
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Example 2: Inter leiiJciii-4 receptor fuBion proteins 

Interleukin-4 (IL-4) is synthesized by T cells and was 
originally called B-cell growth factor because it is able 
to stimulate B-cell proliferation. It exerts a large 
5 number of effects on these cells. One in particular is 

the stimulation of synthesis of molecules of immuno- 
globulin subclasses IgGl and IgE in activated B cells 
(Coffmann et al . , Immunol. Rev., vol. 102 (19B8) 5). In 
addition, IL-4 also regulates the proliferation and 

10 differentiation of T cells and other hemopoietic cells. 

It thus contributes to the regulation of allergic and 
other immunological reactions . IL-4 binds with high 
affinity to a specific receptor. The cDNA which codes for 
the hximan IL-4 receptor has been isolated ( Idzerda et 

15 al., J. Exp. Med., vol. 171 ( 1990) 861-873). It is 

evident from analysis of the amino-acid sequence deduced 
from the cDNA sequence that the IL-4 receptor is composed 
of a total of 825 amino acids, with the 25 N-terminal 
amino acids acting as signal peptide. Mature human IL-4 

20 receptor is composed of 800 amino acids and, like 

thromboplastin, has a three-domain structure: i) amino- 
terminal extracellular domain (207 amino acids); 
ii) transmembrane region (24 amino acids) and iii) 
cytoplasmic domain (569 aimino acids). In the extra- 

25 cellular domain there are six potential sites for 

N-glycosylation ( Asn-X-Thr/Ser ) . IL-4 receptor has 
homologies with human IL-6 receptor, with the ^-subunit 
of human IL-2 receptor, with mouse erythropoietin 
receptor and with rat prolactin receptor (Idzerda et al . , 

30 loc. cit.). Thus, like thromboplastin, it is not a member 

of the immunoglobulin family but is assigned together 
with the homologous proteins mentioned to the new family 
of hematopoietin receptors. Members of this family have 
four cysteine residues and a conserved sequence 

35 (Trp-Ser-X-Trp-Ser ) in the extracellular domain located 

near the transmembrane region in common. 

On the basis of the described function of the IL-4/IL-4 
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receptor ff^tem, there is a poss ible therapeutic use of 
a recombinant form of the IL-4 receptor for suppressing 
IL-4 -mediated immune reactions (for example transplant 
rejection reaction, autoimmune diseases, allergic reac- 
tions ) . 

The amount of substance required for therapy makes it 
necessary to prepare such molecules by genetic 
engineering. Because of the straightforward purification 
by affinity chromatography and improved pharmacokinetic 
properties, according to the invention the synthesis of 
soluble forms of the IL-4 receptor as immunoglobulin 
fusion protein is particularly advantageous. 

The IL-4 receptor fusion proteins are secreted by mamma i — 
ian cells (for example CHO, BHK, COS cells) into the 
culture medium, purified by affinity chromatography on 
protein A-Sepharose and have, surprisingly, identical 
functional properties to the extracellular domain of the 
intact membrane-bound IL-4 receptor molecule. 



Construction of a hybrid plasmid pIL-4RFc coding for IL-4 
receptor fusion protein. 

Cutting of the plasmid pCD4E gamma 1 with Xhol and BamHI 
results in an opened vector in which the open Xhol site 
is located downstream from the promoter sequence. The 
open BamHI site is located at the start of the coding 
regions for a pentapeptide linker, followed by the hinge 
and the CH2 and CH3 domains of human IgGl. The reading 
frame in the BamHI recognition sequence GGATCC is such 
that GAT is translated as aspartic acid. DNA amplifica- 
tion with thermostable DNA polymerase makes it possible 
to modify a given sequence in such a way that any desired 
seqfuences can be attached at one or both ends . Two 
oligonucleotides able to hybridize with sequences in the 
5 ' -untranslated region 

(A: 5' GATCCAGTACTCGAGAGAGAAGCCGGGCGTGGTGGCTCATGC 3 ' ) or 
coding region 
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( B : 5 ' C-iOTGACATGGATCCTGCTCGAAGGGCTCCCTGTAGGAGTTGTG 3 ' ) 
of the IL-4 receptor cDNA which is cloned in the vector 
PDC302/T22.8 (Idzerda et al. , loc . cit. ) were 
synthesized. Of these, oligonucleotide A is partially 
5 homologous with the sequence of the coding strand, and 

oligonucleotide B is partially homologous with the non- 
coding strand; cf. Fig. 5. Amplification using thermo- 
stable DNA polymerase results in a DNA fragment (835 bp) 
which, based on the coding strand, contains at the 5' end 
10 before the start of the coding sequence an Xhol site, and 

at the 3' end before the last codon of the extracellular 
domain a BamHI site. The reading frame in the BamHI 
cleavage site is such that ligation with the BamHI site 
in pCD4E gamma 1 results in a gene fusion with a reading 
15 frame continuous from the initiation codon of the IL-4 

receptor cDNA to the stop codon of the heavy chain of 
IgGl. The desired fragment was obtained and, after 
treatment with Xhol and BamHI, ligated into the vector 
pCD4E gamma 1, described above, which had been cut with 
20 XhoI/BamHI. The resulting plasmid was called pIL4RFc 

(Fig, 6). 



Transfection of pIIi4RFc into mammalian cells 

The fusion protein encoded by the plasmid pIL.4Rrc is 
called pIL4RFc hereinafter, pIL4RFc was transiently 

25 expressed in COS cells. For this purpose, COS cells were 

transfected with pIL4RFc with the aid of DEAE-dextran 
(EP A 0 325 262). Indirect immunofluorescence Investiga- 
tions revealed that the proportion of transfected cells 
was about 25 %. 24 h after transfection, the cells were 

30 transferred into serum-free medium. This cell supernatant 

was harvested after a further three days . 



Purification of IL4RFc fusion protein from cell culture 
Bupematants 



500 ml of supernatant from transiently transfected COS 




cells were collected overnight in a batch process in a 
column containing 1.6 ml of protein A-Sepharoee at 4*C, 
washed with 10 volumes of washing buffer (50 mM tris 
buffer pH 8.6, 150 mM NaCl) and eluted in 0 . 5 ml frac- 
tions with eluting buffer (93 : 7 100 mM citric acid: 
100 mM sodium citrate) . The first 9 fractions were 
immediately neutralized with 0.1 ml of 2M tris buffer 
pH 8.6 in each case and then combined, and the resulting 
protein was transferred by three concentration/dilution 
cycles in an Amicon microconcentrator (Centricon 30) into 
TNE buffer (50 mM tris buffer pH 7,4, 50 mM NaCl, 1 mM 
EDTA) . The IL4RFc obtained in this way is pure by 
SDS-PAGE electrophoresis (U.K. Lammli, Nature 227 (1970) 
680-685). In the absence of reducing agents it behaves in 
the SDS-PAGE like a dimer (about 150 KDa). 

Biological activity of purified IIi4RFc 

IL4RFC proteins binds ^^^I-radiolabeled IL-4 with the same 
affinity (Kd=0.5 nM) as membrane-bound intact IL.-4 recep- 
tor. It inhibits the proliferation of IL-4 -dependent cell 
line CTLLHuIL-4RI clone D (Idzerda et al • , loc . cit.) in 
concentrations of 10-1000 ng/ml . In addition, it is 
outstandingly suitable for developing IL-4 binding assays 
because it can be bound via its Fc part to microtiter 
plates previously coated with, for example, rabbit anti- 
human IgG, and in this form likewise binds its ligands 
with high affinity. 

Example 3: Erythropoietin fusion proteins 

Mature erythropoietin (EPO) is a glycoprotein which is 
composed of 166 amino acids and is essential for the 
development of erythrocytes. It stimulates the maturation 
and the terminal differentiation of erythroid precursor 
cells. The cDNA for human EPO has been cloned 
(EP-A-0 267 678) and codes for the 166 amino acids of 
mature EPO and a signal peptide of 22 amino acids which 
is essential for secretion. The cDNA can be used to 
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prepare recombinant functional EPO in genetically 
manipulated memnnalian cells and the EPO can be employed 
clinically for the therapy of anemic manifestations of 
various etiologies (for example associated with acute 
5 renal failure) . 

Because of the straightforward purification and the 
Improved pharmacokinetic properties, according to the 
invention synthesis of EPO as Immunoglobulin fusion 
protein is particularly advantageous. 

10 Construction of a hybrid plasmld pEPOFc coding for 

erythropoietin fusion protein. 



This construction was carried out in analogy to that 
described in Example 2 (section: "Construction of a 
hybrid plasmid pIL-4RFc coding for IL-4 receptor fusion 
15 protein"). Two oligonucleotides ahle to hybridize with 

sequences in the vicinity of the initiation codon 
(A: 5 'GATCGATCTCGAGATGGGGGTGCACGAATGTCCTGCCTGGCTGTGG 3 ' ) 
and of the stop codon 

( B : 5 ' CTGGAATCGGATCCCCTGTCCTGCAGGCCTCCCCTGTGTACAGC 3 ' ) 

2 0 of the EPO cDNA cloned in the vector pCES (EP-A 0 2 67 

678) were synthesized. Of these, oligonucleotide A is 
partially homologous with the sequence of the coding 
strand, and oligonucleotide B is partially homologous 
with the non-coding strand; cf. Fig. 7, Amplification 

2 5 with thermostable DNA polymerase results in a DNA frag- 

ment (598 bp) which, based on the coding strand, contains 
at the 5 ' end in front of the initiation codon an Xhol 
site and in which at the 3' end the codon for the 
penultimate C-terminal amino acid residue of the EPO 

30 (Asp) is present in a BamHI recognition sequence. The 

reading frame in the BamHI cleavage site is such that 
ligation with the BamHI site in pCD4E gamma 1 r-esults in 
a gene fusion with a reading frame continuous from the 
initiation codon of EPO cDNA to the stop codon of the 

35 heavy chain of IgGl. The desired fragment was obtained 

and, after treatment with Xhol and BamHI, ligated into 
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the vector pCD4E gamma 1, described above, which had been 
cut with XhoI/BamHI. The resulting plasmid was called 
pEPOFc (Fig. 8 ) . 



IE 912256 



- 15 - 



HOE 90/B 026 





Patent riaima* 



1 . 



A soluble fusion protein composed of human proteins 



10 



15 



20 



25 



not belonging to the immunoglobulin family, or of 
parts thereof, and of various portions of immuno- 
globulin molecules of all subclasses. 

2- A fusion protein as claimed in claim 1, wherein the 
immunoglobulin portion is the constant part of the 
heavy chain of human IgG. 

3. A fusion protein as claimed in claim 2, wherein the 
immunoglobulin portion is the constant part of the 
heavy chain of human IgGl or a protein A-binding 
fragment thereof. 

4. A fusion protein as claimed in claim 2 or claim 3, 
wherein the fusion takes place at the hinge region. 

5. A fusion protein as cladLmed in claims 1-4, wherein 
the protein fused to immunoglobulin is the extra- 
cellular portion of a membrane protein or parts 
thereof . 

6. A fusion protein as claimed in claims 1-4, wherein 
the protein fused to immunoglobulin is the extra- 
cellular portion of thromboplastin or parts thereof. 

7. A fusion protein as claimed in claims 1-4, wherein 
the protein fused to immunoglobulin is the extra- 
cellular portion of a cytokine receptor or growth 
factor receptor or parts thereof. 

8. A fusion protein as claimed in claim 7, wherein the 
protein fused to immunoglobulin is the extracellular 
portion of IL-4 receptor or parts thereof . 

9. A fusion protein as claimed in claim 7, wherein the 
protein fused to immunoglobulin is the extracellular 
portion of IL-7 receptor or parts thereof . 




A fusion protein as claimed in claim 7, wherein the 
protein fused to immunoglobulin is the extracellular 
portion of tumor necrosis factor receptor or parts 
thereof • 

A fusion protein as claimed in claim 7, wherein the 
protein fused to immunoglobulin is the extracellular 
portion of G-CSF receptor or parts thereof. 

A fusion protein as claimed in claim 1 , wherein the 
protein fused to Immunoglobulin is the extracellular 
portion of GM-CSF receptor or parts thereof, 

A fusion protein as claimed in claim 7, wherein the 
protein fused to immunoglobulin is the extracellular 
portion of erythropoietin receptor or parts thereof. 

A fusion protein as claimed in claims 1-4, wherein 
the protein fused to immunoglobulin is a non- 
membrane-bound soluble protein or part thereof . 

A fusion protein as claimed in claim 14, wherein the 
protein fused to immunoglobulin is a cytokine or 
growth factor or part thereof. 

A fusion protein as claimed in claim 15, wherein the 
protein fused to immunoglobulin is erythropoietin or 
part thereof. 

A fusion protein as claimed in claim 15, wherein the 
protein fused to immunoglobulin is GM-CSF or G-CSF 
or part thereof. 

A fusion protein as claimed in claim 15, wherein the 
protein fused to immunoglobulin is interleukin IL-1 
to IL-8 or part thereof. 
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19. A fuTion protein as claiined in any of preceding 
claims 1-18, wherein a factor Xa cleavage site is 
additionally inserted between the intmunoglobulin 
part and the non- immunoglobulin part. 

20. A process for preparing fusion proteins as claimed 
in any of claims 1-19, which comprises introducing 
the DNA coding for these constructs into a mammalian 
cell expression system and, after expression, 
purifying the produced fusion protein by affinity 
chromatography via the immunoglobulin portion. 

21. The use of the fusion proteins as claimed in any of 
claims 1-19 for diagnosis. 

22. The use of the fusion proteins as claimed in any of 
claims 1-19 for therapy. 

23. A fusion protein as claimed in claim 1, substantially 
as hereinbefore described and exemplified. 

24. A process for preparing a fusion protein as claimed in 
claim 1, substantially as hereinbefore described and 
exempl if ied . 

25. A fusion protein as claimed in claim 1, whenever prepai 
by a process claimed in claim 20 or 24. 

Dated this the 27th day of June, 1991 
F. R. KELLY & CO. 
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^- v> :\ r- \^ f\rt i x .\ 

1/11 



GTCGCTCGGACGCTCCTGCTCGGCTGGGTCTTCGCCCAGGTGGCCGGCGCnCAGGCACT 
121 -- + - + + + - + + ISO 

CAGCGAGCCTGCGAGGACGAGCCGACCCAGAAGCGGGTCCACCGGCCGCGAAGTCCGTGA 

Oligonucleotide 1 

ACAMTACTGTGGCAGCATATAATTTAACnGGAMTCAACTAATTTCMGACAATTTTG 
181 ^. + + ^ ^ 240 

TGTTTATGACACCGTCGTATATTAAATTGAACCTTTAGTTGATTAAAGTTCTGTTAAAAC 



Oligonucleotide 2 

AACTACTGTTTCAGTGTTCAAGCAGTGATTCCCTCCCGAACAGTTAACCGGAAGAGTACA 
721 ^ . + + ^ ^ 730 

TTGATGACAAAGTCACAAGTTCGTCACTAAGGGAGGGCTTGTCAATTGGCCTTCTCATGT 



IE 912256 

BEHR:NGWERKE AKTIENGE^|fc.SCAHFT 11 sheets - TRUE COPY 

CORPORATION 2/11 



10 30 50 

GCCCCCCCTCGAGGTCGACGGTATCGATAAGCTTGATATCGAATTCTCTCGGCG/iAr.CCC 

70 90 110 

CTCGCACTCCCTCTGGCCGGCCCAGGGCGCCTTCAGCCCAACCTCCCCAGCCCCACGGGC 

130 150 170 

GCCACGGAACCCGCTCGATCTCGCCGCCAACTGGTAGACATGGAGACCCCTGCCTGGCCC 

MetGI uThrProAl aTrpPro 

190 210 230 

CGGGTCCCGCGCCCCGAGACCGCCGTCGCTCGGACGCTCCTGCTCGGCTGGGTCTTCGCC 
ArgVal ProArgProGl uThrAl aVal Al aArgThrLeuLeuLeuGlyTrpVal PheAl a 

250 270 290 

CAGGTGGCCGGCGCTTCAGGCACTACAAATACTGTGGCAGCATATAATTTAACTTGGAAA 
Gl nVal Al aGlyAI aSerGl^ThrThrAsnThrVal Al aAl aTyrAsnLeuThrTrpLys 

310 330 350 

TCAACTAATTTCAAGACAATTTTGGAGTGGGAACCCAAACCCGTCAATCAAGTCTACACT 
SerThrAsnPheLysThrlleLeuGl uTrpGl uProLysProVal AsnGl nVal TyrThr 

370 390 410 

GTTCAAATAAGCACTAAGTCAGGAGATTGGAAAAGCAAATGCTTTTACACAACAGACACA 
Val GlnlleSerThrLysSerGlyAspTrpLysSerLysCysPheTyrThrThrAspThr 

430 450 470 

GAGTGTGACCTCACCGACGAGATTGTGAAGGATGTGAAGCAGACGTACTTGGCACGGGTC 
Gl uCysAspLeuThrAspGl u IleVal LysAspVal LysGl nThrTyrLeuAl aArgVal 

490 510 530 

TTCTCCTACCCGGCAGGGAATGTGGAGAGCACCGGTTCTGCTGGGGAGCCTCTGTATGAG 
PheSerTyrProAl aGlyAsnValGl uSerThrGlySerAl aGlyGl uProLeuTyrGI u 

550 570 590 

AACTCCCCAGAGTTCACACCTTACCTGGAGACAAACCTCGGACAGCCAACAATTCAGAGT 
AsnSerProGl uPheThrProTyrLeuGl uThrAsnLeuGlyGlnProThrlleGlnSer 



IE 912256 

BEHRINGWERKE AKTIENGE^P^SCAHFT 11 sheets - TRUE ^0:=v 

and THE GENERAL HOSPITAL sheet 3 - - - 

CORPORATION 3/11 



M^igj : 2 (cont) 



610 630 650 

TTTGAACAGGTGGGAACAAAAGTGAATGTGACCGTAGAAGATGAACGGACTTTAGTCAGA 
PheGluGlnValGlyThrLysValAsnVaUhrValGluAspGluArgThrLeuValArg 

670 690 710 

AGGAACAACACTTTCCTAAGCCTCCGGGATGTTTTTGGCAAGGACTTAATTTATACACTT 
ArgAsnAsnThrPheLeuSerLeuArgAspValPheGlyLysAspLeuIleTyrThrLeu 

730 750 770 

TATTATTGGAAATCTTCAAGnCAGGAAAGAAAACAGCCAAAACAAACACTAATGAGTTT 
TyrTyrTrpLysSerSerSerSerGlyLysLysThrAlaLysThrAsnThrAsnGluPhe 

790 810 830 

TTGATTGATGTGGATAAAGGAGAAAACTACTGTTTCAGTGTTCAAGCAGTGATTCCCTCC 
LeuIleAspValAspLysGlyGluAsnTyrCysPheSerValGlnAlaVal IleProSer 

850 870 890 

CGAACAGTTAACCGGAAGAGTACAGACAGCCCGGTAGAGTGTATGGGCCAGGAGAAAGGG 
ArgThrVal AsnArgLysSerThrAspSerProVal Gl uCysMetGlyGl nGI uLysGly 

910 930 950 

GAATTCAGAGAAATATTCTACATCATTGGAGCTGTGGTATTTGTGGTCATCATCCTTGTC 
GluPheArgGluIlePheTyrneneGlyAlaValValPheValVal IlelleLeuVal 

970 990 1010 

ATCATCCTGGCTATATCTCTACACAAGTGTAGAAAGGCAGGAGTGGGGCAGAGCTGGAAG 
nelleLeuAl alleSerLeuHisLysCysArgLysAl aGlyVal GlyGl nSerTrpLys 

1030 1050 1070 

GAGAACTCCCCACTGAATGTTTCATAAAGGAAGCACTGTTGGAGCTACTGCAAATGCTAT 
Gl uAsnSerProLeuAsnVal Ser 

1090 1110 1130 

ATTGCACTGTGACCGAGAACTTTTAAGAGGATAGAATACATGGAAACGCAAATGAGTATT 

1150 1170 1190 

TCGGAGCATGAAGACCCTGGAGTTCAAAAAACTCTTGATATGACCTGTTATTACCATTA3 



F.R. Kellv 



IE 912256 

BEHRINGWERKE AKT I ENGE^^,SCAK FT 11 sheets - TRUE COPY 

and THE GENERAL HOSPIl^T sheet 4 
CORPQKATICN l^/]] 



- 2 (cont.) 



1210 1230 1250 

CATTCTGGTTTTGACATCAGCATTAGTCACTTTGAAATGTAACGAATGGTACTACAACCA 

1270 1290 1310 

ATTCCAAGTTTTAATTTTTAACACCATGGCACCTTTTGCACATAACATGCTTTAGATTAT 

1330 1350 1370 

ATATTCCGCACTTAAGGATTAACCAGGTCGTCCAAGCAAAAACAAATGGGAAAATGTCTT 

1390 1410 1430 

AAAAAATCCTGGGTGGACTTTTGAAAAGCTTTTTTTTTTTTTTTTTTTTGAGACGGAGTC 

1450 1470 1490 

TTGCTCTGTTGCCCAGGCTGGAGTGCAGTAGCACGATCTCGGCTCACTTGCACCCTCCGT 

1510 1530 1550 

CTCTCGGGTTCAAGCAATTGTCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGTGCGC 

1570 1590 1610 

ACTACCACGCCAAGCTAATTTTTGTATTTTTTAGTAGAGATGGGGTTTCACCATCTTGGC 

1630 1650 1670 

CAGGCTGGTCTT6AATTCCTGACCTCAGTGATCCACCCACCTTGGCCTCCCAAAGATGCT 

1690 1710 1730 

AGTATTATGGGCGTGAACCACCATGCCCAGCCGAAAAGCTTTTGAGGGGCTGACTTCAAT 

1750 1770 1790 

CCATGTAGGAAAGTAMATGGAAGGAAATTGGGTGCATTTCTAGGACTTTTCTAACATAT 

1810 1830 1850 

GTCTATAATATAGTGTTTAGGTTCTTTI i I I I I i CAGGAATACATTTGGAAATTCAAAAC 

1870 1890 1910 

AATTGGGCAAACTTTGTATTAATGTGTTAAGTGCAQGAGACATTGGTATTCTGGGCAGCT 



IE 912256 



BEHRINGWERKE AKT lENGES^^SCAHFT 11 sheets - TRUE C 

and THE GENERAL HOSPITAL sheet 5 

CORPORATION 5/]] 



2 (cont) 



1930 1950 1970 

TCCTAATATGCTTTACAATCTGCACTTTAACTGACTTAAGTGGCATTAAACATTTGAGAG 

1990 2010 2030 

CTAACTATATTTTTATAAGACTACTATACAAACTACAGAGTTTATGATTTAAGGTACTTA 

2050 2070 2090 

AAGCTTCTATGGnGACATTGTATATATAAI I I 1 ITAAAAAGGTTTTTCTATATGGGGAT 

2110 2130 2150 

TTTCTATTTATGTAGGTAATATTGTTCTATTTGTATATATTGAGATAATTTATTTAATAT 



2170 

ACTTTAAATAAAGGTGACTGGGAATTGTT 



IE 912256 



BEHRINGKERKE AKTIENGE^fc SCAHFT 11 sheets - TRUE COPY 

and THE GENERAL HOSPI'lT^ sheet 6 

CORPORATION g^l^ 



Hindlll 

5' GATCGATTAAGCTTCGGAACCCGCTCGATCTCGCCGCC 3' Oligonucleotide A 

Illlllllllllllllllllllll 

AGCCCCACGGGCGCCACGGAACCCGCTCGATCTCGCCGCCAACTGGTAGACATGGAG 

110 + + + + + 167 

TCGGGGTGCCCGCGGTGCCTTGGGCGAGCTAGAGCGGCGGTTGACCATCTGTACCTC 



MetGlu 



-untranslated Start 

Reading frame 
(signal peptide) 



End of extracellular domain | Start of transmembrane region 



GlnGluLysGlyGluPheArgGluIlePheTyrneneGlyAlaVal 
CAGGAGAAAGGGGAATTCAGAGAAATATTCTACATCATTGGAGCTGTGGT 

890 + + + + + 940 

GTCCTCTTTCCCCTTAAGTCTCTTTATAAGATGTAGTAACCTCGACACCA 

IIIIIIIMMIIIIIIIIIIIII 
3' CCCCTTAAGTCTCTTTATAAGATGCCCCTAGGTCTATACG 5 ' Oligonucleotide B 



BarTiHI 



C D Voll. S. ( 
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3000 



IE 912256 

BEHRINGWERKE AKT I ENGE^^.S CAHFT 11 sheets - TRUE COPY 

ctiid .HE GENERAL HOSrl'SBF sheer 8 



; h e e r 8 
:OR PORA T: ON 

8/n 



Xhol 

5' GATCCAGTACTCGAGAGAGAAGCCGGGCGTGGTGGCTCATGC 3' Oligonucleotide A 

i M 1 1 1 ! i 1 1 1 1 M 1 : 1 i M 1 1 1 1 1 1 1 

AGAGAAGCCGGGCGTGGTGGCTCATGCCTATAATCCCAGCACTTTTGGAGGCTGAGGCGG 

+ - + + + 12C 

TCTCTTCGGCCCGCACCACCGAGTACGGATATTAGGGTCGTGAAAACCTCCGACTCCGCC 



5 ' -untranslated 



GCAGATCACTTGAGATCAGGAGTTCGAGACCAGCCT6GTGCCTTGGCATCTCCCAATGGG 

121 ^ + + + + + 180 

CGTCTAGTGAACTCTAGTCCTCAAGCTCTGGTCGGACCACGGAACCGTAGAGGGTTACCC 



-5 ' -untranslated [MetGly 



Start 

Reading fraine (signal peptide) 



End of extracellular domain | Start of transmembrane region 

I 

Hi sAsnSerTyrArgGl uProPheGl uGl nHI s Le uLeuLeuGl yVal SerVal SerCys 
CACAACTCCTACAGGGAGCCCTTCGAGCAGCACCTCCTGCTGGGCGTCAGCGTTTCCTGC 
639 ^ ^ + - + ^ - 89B 

GTGTTGAGGATGTCCCTCGGGAAGCTCGTCGTGGAGGACGACCCGCAGTCGCAAAGGACG 

iii!iii!;iMili!ii:iM;;:Nlll 

3' GTGTTGAGGATG'CCCTCGGGAAGCTCGTCCTAGGTACAGTATC 5' Oligonucleotide B 



IE 912256 

BEHRINGWERKE AKTIENGE|^^SCAHFT 11 sheets - TRL'E CC 

ar.d THE GENERAL HOSPlI^^ sheet 9 



COKPORATICN 



9/n 




IE 912256 



BEHRI.NGV.-ERKE AKT I ENGE^^ SCAH FT 11 sheets - -;^':tr -Qpv 

and THE GENERAL HCSPI^Ir shppt 10 

CORPORATION 



Xhol 

5' GATCGATCTCGAGATGGGGGTGCACGAATGTCCTGCCTGGCTGTGG 3' Oligonucleotide A 

llllllllilllllllMIIIIIIIIIIIIIII 

ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCTGTCG 

382 - + + + + 235 

TACCCCCACGTGCTTACAGGACGGACCGACACCGAAGAGGACAGGGACGACAGC 

HetGlyVal Hi sGl uCysProAl aTrpLeuTrpLeuLeuLeuSerLeuLeuSer - 

Start reading frame (signal peptide) 



End of reading frame - | 

LeuTyrThrGlyGl uAl aCysArgThrGlyAspArgEnd 

I 

GCTGTACACAGGGGAGGCCTGCAGGACAGGGGACAGATGACCAGGT6TGTCCACCTGGGC 
724 + + + 4. + - + 783 

CGACATGTGTCCCCTCCGGACGTCCTGTCCCCTGTCTACTGGTCCACACAGGTGGACCCG 

IlillMMIIMIIMIMIIilllllllMI 

3' CGACATGTGTCCCCTCCGGACGTCCTGTCCCCTAGGCTAAGGTC 5' Oligonucleotide B 

BanHI 
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